Virtual Parallelism allows Relaxing the Synchronization Constraints of SIMD Computing Paradigm
نویسندگان
چکیده
In this paper we propose to introduce execution autonomy in the SIMD paradigm to overcome its rigidity while preserving the advantages of its synchronous programming model and we show that Virtual Parallelism support is a necessary condition to the profitable application of execution autonomy. We define execution autonomy as the capability of each processing element of a massively parallel computer to execute the instructions in a block of code of a single common program autonomously and asynchronously. We define virtual parallelism as the capability to emulate a n processors array on a m processor array with n/m performance degradation. In past related works the relaxation of SIMD synchronization has been already proposed, nevertheless its relation with Virtual Parallelism has never been studied. keywords: SIMD, Computational Paradigms, Execution Autonomy, Virtual Parallelism, Data-parallelism.
منابع مشابه
Pure SIMD Processor Arrays with a Two-Dimensional Reconfigurable Network Do Not Support Virtual Parallelism♣
Abstract The support of virtual parallelism is important because it allows to consider the complexity measurements of the parallel algorithms valid in those implementations in which the size of the processor array is smaller than the problem size. In this paper we demonstrate that pure SIMD RPAs, i. e. with no addressing autonomy, that allow to establish tree-shaped two-dimensional buses do not...
متن کاملDesign and Implementation of the CCC Parallel Programming Language
CCC is a high-level parallel programming language that aims to provide a coherent integration of various parallel programming paradigms. CCC supports both data and task parallelism. In CCC, data parallelism is specified in SIMD model, while task parallelism is specified in MIMD model. Task parallelism in CCC supports both message-passing communication abstraction and shared-variables synchroniz...
متن کاملDLP + TLP Processors for the Next Generation of Media Workloads
Future media workloads will require about two levels of magnitude the performance achieved by current general purpose processors. High uni-threaded performance will be needed to accomplish real-time constraints together with huge computational throughput, as next generation of media workloads will be eminently multithreaded (MPEG4/MPEG-7). In order to fulfill the challenge of providing both goo...
متن کاملAn Efficient Massively Parallel Rasterization Scheme For a High Performance Graphics System
We present in this paper the IMOGENE II system, a massively parallel Multi-SIMD graphics system. This architecture uses a new rasterization scheme combining Object Parallelism and Parallel Virtual Buffers. This scheme leads to a better efficiency than other massively parallel SHvlD systems, and allows a cost-effective, powerful and easily expandabJe system to be designed. The syst.em consists o...
متن کاملSegmented Arithmetic Operators for Graphics Processing
Graphics processing relies on executing similar instructions repeatedly on a large data set. This parallelism in the data gives rise to the Single-Instruction Multiple-Data (SIMD) paradigm which is used in modern processors. This paper explores several techniques that exploit the parallelism in the SIMD execution functional units and proposes several new SIMD methods. The methods discussed in t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998